Interactive Visualization and Navigation in Large Data Collections using the Hyperbolic Space

نویسندگان

  • Jörg A. Walter
  • Jörg Ontrup
  • Daniel Wessling
  • Helge J. Ritter
چکیده

We propose the combination of two recently introduced methods for the interactive visual data mining of large collections of data. Both, Hyperbolic Multi-Dimensional Scaling (HMDS) and Hyperbolic Self-Organizing Maps (HSOM) employ the extraordinary advantages of the hyperbolic plane (H2): (i) the underlying space grows exponentially with its radius around each point ideal for embedding high-dimensional (or hierarchical) data; (ii) the Poincaré model of the IH exhibits a fish-eye perspective with a focus area and a context preserving surrounding; (iii) the mouse binding of focus-transfer allows intuitive interactive navigation. The HMDS approach extends multi-dimensional scaling and generates a spatial embedding of the data representing their dissimilarity structure as faithfully as possible. It is very suitable for interactive browsing of data object collections, but calls for batch precomputation for larger collection sizes. The HSOM is an extension of Kohonen’s Self-Organizing Map and generates a partitioning of the data collection assigned to an IH tessellating grid. While the algorithm’s complexity is linear in the collection size, the data browsing is rigidly bound to the underlying grid. By integrating the two approaches we gain the synergetic effect of adding advantages of both. And the hybrid architecture uses consistently the IH visualization and navigation concept. We present the successfully application to a text mining example involving the Reuters-21578 text corpus.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

JustClick: Personalized Image Recommendation via Exploratory Search from Large-Scale Flickr Image Collections

In this paper, we have developed a novel framework called JustClick to enable personalized image recommendation via exploratory search from large-scale collections of manuallyannotated Flickr images. First, a topic network is automatically generated to summarize large-scale collections of manuallyannotated Flickr images at a semantic level. Hyperbolic visualization is further used to enable int...

متن کامل

Navigating Large Hierarchical Space Using Invisible Links

To date, many web visualization applications have shown the usefulness of a hyperbolic tree. However, we have discovered that strict hierarchical tree structures are too limited. For many practical applications, we need to generalize a hyperbolic tree to a hyperbolic space. This approach results in massive cross-links in a highly connected graph that clutter the display. To resolve this problem...

متن کامل

ProbMap - A probabilistic approach for mapping large document collections

The visualization of large text databases and document collections is an important step towards more exible and interactive types of information access and retrieval. This paper presents a probabilistic approach which combines a statistical, model{ based analysis of a given set of document with a topological visualization principle. Our method can be utilized to derive topic maps, which represe...

متن کامل

Interactive Analysis of Space Frame Raft Soil System

This study presents a new approach for physical and material modeling of space frame-raft-soil system. The physical modeling consists of a modified Thimoshenko beam bending element with six degrees of freedom per node to model the beams and columns of the superstructure, a modified Mindlin's plate bending element with five degrees of freedom per node to represent the structural slabs and raft, ...

متن کامل

A Human-Centered Computing Framework to Enable Personalized News Video Recommendation

In this chapter, an interactive framework is developed to enable personalized news video recommendation and allow news seekers to access large-scale news videos more effectively. First, multiple information sources (audio, video and closed captions) are seamlessly integrated and synchronized to achieve more reliable news topic detection, and the inter-topic contextual relationships are extracte...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003